Overview
Brought to you by YData
Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 999 |
| Missing cells | 428 |
| Missing cells (%) | 2.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 760.9 KiB |
| Average record size in memory | 780.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Text | 8 |
| Categorical | 1 |
Rating is highly overall correlated with Unnamed: 0 | High correlation |
Revenue is highly overall correlated with Votes | High correlation |
Unnamed: 0 is highly overall correlated with Rating | High correlation |
Votes is highly overall correlated with Revenue | High correlation |
Certificate has 101 (10.1%) missing values | Missing |
scoreAvg has 157 (15.7%) missing values | Missing |
Revenue has 169 (16.9%) missing values | Missing |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
Overview has unique values | Unique |
Reproduction
| Analysis started | 2025-09-02 13:50:10.382906 |
|---|---|
| Analysis finished | 2025-09-02 13:50:39.836286 |
| Duration | 29.45 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Unnamed: 0
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 999 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 500 |
| Minimum | 1 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 50.9 |
| Q1 | 250.5 |
| median | 500 |
| Q3 | 749.5 |
| 95-th percentile | 949.1 |
| Maximum | 999 |
| Range | 998 |
| Interquartile range (IQR) | 499 |
Descriptive statistics
| Standard deviation | 288.53076 |
|---|---|
| Coefficient of variation (CV) | 0.57706152 |
| Kurtosis | -1.2 |
| Mean | 500 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 0 |
| Sum | 499500 |
| Variance | 83250 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 999 | 1 | 0.1% |
| 1 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 6 | 1 | 0.1% |
| 983 | 1 | 0.1% |
| 982 | 1 | 0.1% |
| 981 | 1 | 0.1% |
| Other values (989) | 989 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 999 | 1 | |
| 998 | 1 | |
| 997 | 1 | |
| 996 | 1 | |
| 995 | 1 | |
| 994 | 1 | |
| 993 | 1 | |
| 992 | 1 | |
| 991 | 1 | |
| 990 | 1 |
Title
Text
| Distinct | 998 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 73.0 KiB |
Length
| Max length | 68 |
|---|---|
| Median length | 41 |
| Mean length | 15.443443 |
| Min length | 2 |
Unique
| Unique | 997 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | The Godfather |
|---|---|
| 2nd row | The Dark Knight |
| 3rd row | The Godfather: Part II |
| 4th row | 12 Angry Men |
| 5th row | The Lord of the Rings: The Return of the King |
| Value | Count | Frequency (%) |
| the | 274 | 9.8% |
| of | 86 | 3.1% |
| a | 32 | 1.2% |
| and | 28 | 1.0% |
| no | 24 | 0.9% |
| la | 23 | 0.8% |
| in | 22 | 0.8% |
| to | 18 | 0.6% |
| de | 17 | 0.6% |
| man | 17 | 0.6% |
| Other values (1664) | 2241 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1783 | 11.6% | |
| e | 1425 | 9.2% |
| a | 1126 | 7.3% |
| o | 965 | 6.3% |
| n | 921 | 6.0% |
| i | 861 | 5.6% |
| r | 816 | 5.3% |
| t | 755 | 4.9% |
| h | 564 | 3.7% |
| s | 562 | 3.6% |
| Other values (90) | 5650 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11162 | |
| Uppercase Letter | 2191 | 14.2% |
| Space Separator | 1783 | 11.6% |
| Other Punctuation | 177 | 1.1% |
| Decimal Number | 79 | 0.5% |
| Dash Punctuation | 31 | 0.2% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Other Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1425 | |
| a | 1126 | |
| o | 965 | 8.6% |
| n | 921 | 8.3% |
| i | 861 | 7.7% |
| r | 816 | 7.3% |
| t | 755 | 6.8% |
| h | 564 | 5.1% |
| s | 562 | 5.0% |
| l | 514 | 4.6% |
| Other values (38) | 2653 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 283 | 12.9% |
| S | 187 | 8.5% |
| B | 157 | 7.2% |
| M | 139 | 6.3% |
| D | 129 | 5.9% |
| L | 119 | 5.4% |
| A | 113 | 5.2% |
| C | 101 | 4.6% |
| H | 98 | 4.5% |
| P | 97 | 4.4% |
| Other values (18) | 768 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 23 | |
| 1 | 15 | |
| 0 | 11 | |
| 3 | 8 | 10.1% |
| 4 | 5 | 6.3% |
| 7 | 5 | 6.3% |
| 9 | 4 | 5.1% |
| 5 | 4 | 5.1% |
| 8 | 2 | 2.5% |
| 6 | 2 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 62 | |
| . | 47 | |
| ' | 32 | |
| , | 16 | 9.0% |
| ! | 7 | 4.0% |
| & | 6 | 3.4% |
| ? | 3 | 1.7% |
| / | 3 | 1.7% |
| · | 1 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1783 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 31 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13353 | |
| Common | 2075 | 13.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1425 | 10.7% |
| a | 1126 | 8.4% |
| o | 965 | 7.2% |
| n | 921 | 6.9% |
| i | 861 | 6.4% |
| r | 816 | 6.1% |
| t | 755 | 5.7% |
| h | 564 | 4.2% |
| s | 562 | 4.2% |
| l | 514 | 3.8% |
| Other values (66) | 4844 |
Common
| Value | Count | Frequency (%) |
| 1783 | ||
| : | 62 | 3.0% |
| . | 47 | 2.3% |
| ' | 32 | 1.5% |
| - | 31 | 1.5% |
| 2 | 23 | 1.1% |
| , | 16 | 0.8% |
| 1 | 15 | 0.7% |
| 0 | 11 | 0.5% |
| 3 | 8 | 0.4% |
| Other values (14) | 47 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15362 | |
| None | 66 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1783 | 11.6% | |
| e | 1425 | 9.3% |
| a | 1126 | 7.3% |
| o | 965 | 6.3% |
| n | 921 | 6.0% |
| i | 861 | 5.6% |
| r | 816 | 5.3% |
| t | 755 | 4.9% |
| h | 564 | 3.7% |
| s | 562 | 3.7% |
| Other values (64) | 5584 |
None
| Value | Count | Frequency (%) |
| ô | 14 | |
| é | 6 | 9.1% |
| û | 5 | 7.6% |
| è | 5 | 7.6% |
| â | 5 | 7.6% |
| ä | 4 | 6.1% |
| î | 2 | 3.0% |
| ù | 2 | 3.0% |
| ü | 2 | 3.0% |
| á | 2 | 3.0% |
| Other values (16) | 19 |
Year
Real number (ℝ)
| Distinct | 99 |
|---|---|
| Distinct (%) | 9.9% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1991.2144 |
| Minimum | 1920 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 1920 |
|---|---|
| 5-th percentile | 1944 |
| Q1 | 1976 |
| median | 1999 |
| Q3 | 2009 |
| 95-th percentile | 2017 |
| Maximum | 2020 |
| Range | 100 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 23.308539 |
|---|---|
| Coefficient of variation (CV) | 0.01170569 |
| Kurtosis | -0.02478235 |
| Mean | 1991.2144 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.93854006 |
| Sum | 1987232 |
| Variance | 543.28798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 32 | 3.2% |
| 2004 | 31 | 3.1% |
| 2009 | 29 | 2.9% |
| 2013 | 28 | 2.8% |
| 2016 | 28 | 2.8% |
| 2001 | 27 | 2.7% |
| 2006 | 26 | 2.6% |
| 2007 | 26 | 2.6% |
| 2015 | 25 | 2.5% |
| 2012 | 24 | 2.4% |
| Other values (89) | 722 |
| Value | Count | Frequency (%) |
| 1920 | 1 | 0.1% |
| 1921 | 1 | 0.1% |
| 1922 | 1 | 0.1% |
| 1924 | 1 | 0.1% |
| 1925 | 2 | |
| 1926 | 1 | 0.1% |
| 1927 | 2 | |
| 1928 | 2 | |
| 1930 | 1 | 0.1% |
| 1931 | 3 |
| Value | Count | Frequency (%) |
| 2020 | 6 | 0.6% |
| 2019 | 23 | |
| 2018 | 19 | |
| 2017 | 22 | |
| 2016 | 28 | |
| 2015 | 25 | |
| 2014 | 32 | |
| 2013 | 28 | |
| 2012 | 24 | |
| 2011 | 18 |
Certificate
Categorical
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 101 |
| Missing (%) | 10.1% |
| Memory size | 2.6 KiB |
| U | |
|---|---|
| A | |
| UA | |
| R | |
| PG-13 | |
| Other values (11) |
Length
| Max length | 8 |
|---|---|
| Median length | 1 |
| Mean length | 1.7371938 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | A |
|---|---|
| 2nd row | UA |
| 3rd row | A |
| 4th row | U |
| 5th row | U |
Common Values
| Value | Count | Frequency (%) |
| U | 234 | |
| A | 196 | |
| UA | 175 | |
| R | 146 | |
| PG-13 | 43 | 4.3% |
| PG | 37 | 3.7% |
| Passed | 34 | 3.4% |
| G | 12 | 1.2% |
| Approved | 11 | 1.1% |
| TV-PG | 3 | 0.3% |
| Other values (6) | 7 | 0.7% |
| (Missing) | 101 |
Length
| Value | Count | Frequency (%) |
| u | 234 | |
| a | 196 | |
| ua | 175 | |
| r | 146 | |
| pg-13 | 43 | 4.8% |
| pg | 37 | 4.1% |
| passed | 34 | 3.8% |
| g | 12 | 1.3% |
| approved | 11 | 1.2% |
| tv-pg | 3 | 0.3% |
| Other values (6) | 7 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 9.4% |
| P | 119 | 7.6% |
| G | 97 | 6.2% |
| s | 68 | 4.4% |
| - | 48 | 3.1% |
| e | 46 | 2.9% |
| d | 46 | 2.9% |
| 1 | 45 | 2.9% |
| Other values (14) | 150 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1168 | |
| Lowercase Letter | 253 | 16.2% |
| Decimal Number | 90 | 5.8% |
| Dash Punctuation | 48 | 3.1% |
| Other Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 68 | |
| e | 46 | |
| d | 46 | |
| a | 35 | |
| p | 22 | 8.7% |
| r | 12 | 4.7% |
| o | 11 | 4.3% |
| v | 11 | 4.3% |
| n | 1 | 0.4% |
| t | 1 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 12.5% |
| P | 119 | 10.2% |
| G | 97 | 8.3% |
| T | 5 | 0.4% |
| V | 5 | 0.4% |
| M | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 45 | |
| 3 | 43 | |
| 6 | 1 | 1.1% |
| 4 | 1 | 1.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 48 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1421 | |
| Common | 139 | 8.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 10.3% |
| P | 119 | 8.4% |
| G | 97 | 6.8% |
| s | 68 | 4.8% |
| e | 46 | 3.2% |
| d | 46 | 3.2% |
| a | 35 | 2.5% |
| p | 22 | 1.5% |
| Other values (8) | 47 | 3.3% |
Common
| Value | Count | Frequency (%) |
| - | 48 | |
| 1 | 45 | |
| 3 | 43 | |
| 6 | 1 | 0.7% |
| 4 | 1 | 0.7% |
| / | 1 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 9.4% |
| P | 119 | 7.6% |
| G | 97 | 6.2% |
| s | 68 | 4.4% |
| - | 48 | 3.1% |
| e | 46 | 2.9% |
| d | 46 | 2.9% |
| 1 | 45 | 2.9% |
| Other values (14) | 150 | 9.6% |
Runtime
Real number (ℝ)
| Distinct | 140 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 122.87187 |
| Minimum | 45 |
|---|---|
| Maximum | 321 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 45 |
|---|---|
| 5-th percentile | 87 |
| Q1 | 103 |
| median | 119 |
| Q3 | 137 |
| 95-th percentile | 178 |
| Maximum | 321 |
| Range | 276 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 28.101227 |
|---|---|
| Coefficient of variation (CV) | 0.2287035 |
| Kurtosis | 3.4289066 |
| Mean | 122.87187 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 1.2098771 |
| Sum | 122749 |
| Variance | 789.67896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 23 | 2.3% |
| 130 | 23 | 2.3% |
| 129 | 22 | 2.2% |
| 101 | 22 | 2.2% |
| 113 | 22 | 2.2% |
| 110 | 20 | 2.0% |
| 122 | 20 | 2.0% |
| 108 | 19 | 1.9% |
| 102 | 18 | 1.8% |
| 96 | 17 | 1.7% |
| Other values (130) | 793 |
| Value | Count | Frequency (%) |
| 45 | 1 | 0.1% |
| 64 | 1 | 0.1% |
| 67 | 1 | 0.1% |
| 68 | 1 | 0.1% |
| 69 | 1 | 0.1% |
| 70 | 1 | 0.1% |
| 71 | 2 | |
| 72 | 2 | |
| 75 | 2 | |
| 76 | 3 |
| Value | Count | Frequency (%) |
| 321 | 1 | |
| 242 | 1 | |
| 238 | 1 | |
| 229 | 1 | |
| 228 | 1 | |
| 224 | 1 | |
| 220 | 1 | |
| 212 | 1 | |
| 210 | 1 | |
| 209 | 1 |
Genre
Text
| Distinct | 202 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.3 KiB |
Length
| Max length | 29 |
|---|---|
| Median length | 24 |
| Mean length | 19.077077 |
| Min length | 5 |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | 7.2% |
Sample
| 1st row | Crime, Drama |
|---|---|
| 2nd row | Action, Crime, Drama |
| 3rd row | Crime, Drama |
| 4th row | Crime, Drama |
| 5th row | Action, Adventure, Drama |
| Value | Count | Frequency (%) |
| drama | 723 | |
| comedy | 233 | 9.2% |
| crime | 209 | 8.2% |
| adventure | 196 | 7.7% |
| action | 189 | 7.4% |
| thriller | 137 | 5.4% |
| romance | 125 | 4.9% |
| biography | 109 | 4.3% |
| mystery | 99 | 3.9% |
| animation | 82 | 3.2% |
| Other values (11) | 438 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2018 | 10.6% |
| r | 1871 | 9.8% |
| , | 1541 | 8.1% |
| 1541 | 8.1% | |
| m | 1447 | 7.6% |
| e | 1235 | 6.5% |
| i | 1144 | 6.0% |
| o | 896 | 4.7% |
| n | 760 | 4.0% |
| t | 727 | 3.8% |
| Other values (23) | 5878 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13264 | |
| Uppercase Letter | 2626 | 13.8% |
| Other Punctuation | 1541 | 8.1% |
| Space Separator | 1541 | 8.1% |
| Dash Punctuation | 86 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2018 | |
| r | 1871 | |
| m | 1447 | |
| e | 1235 | |
| i | 1144 | |
| o | 896 | |
| n | 760 | 5.7% |
| t | 727 | 5.5% |
| y | 718 | 5.4% |
| c | 433 | 3.3% |
| Other values (8) | 2015 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 723 | |
| A | 467 | |
| C | 442 | |
| F | 208 | 7.9% |
| M | 151 | 5.8% |
| T | 137 | 5.2% |
| R | 125 | 4.8% |
| B | 109 | 4.2% |
| H | 88 | 3.4% |
| S | 86 | 3.3% |
| Other values (2) | 90 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1541 |
Space Separator
| Value | Count | Frequency (%) |
| 1541 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15890 | |
| Common | 3168 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2018 | |
| r | 1871 | |
| m | 1447 | 9.1% |
| e | 1235 | 7.8% |
| i | 1144 | 7.2% |
| o | 896 | 5.6% |
| n | 760 | 4.8% |
| t | 727 | 4.6% |
| D | 723 | 4.6% |
| y | 718 | 4.5% |
| Other values (20) | 4351 |
Common
| Value | Count | Frequency (%) |
| , | 1541 | |
| 1541 | ||
| - | 86 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19058 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2018 | 10.6% |
| r | 1871 | 9.8% |
| , | 1541 | 8.1% |
| 1541 | 8.1% | |
| m | 1447 | 7.6% |
| e | 1235 | 6.5% |
| i | 1144 | 6.0% |
| o | 896 | 4.7% |
| n | 760 | 4.0% |
| t | 727 | 3.8% |
| Other values (23) | 5878 |
Rating
Real number (ℝ)
High correlation 
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.9479479 |
| Minimum | 7.6 |
|---|---|
| Maximum | 9.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 7.6 |
|---|---|
| 5-th percentile | 7.6 |
| Q1 | 7.7 |
| median | 7.9 |
| Q3 | 8.1 |
| 95-th percentile | 8.5 |
| Maximum | 9.2 |
| Range | 1.6 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.27228951 |
|---|---|
| Coefficient of variation (CV) | 0.034259096 |
| Kurtosis | 1.0583968 |
| Mean | 7.9479479 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 0.94669269 |
| Sum | 7940 |
| Variance | 0.074141576 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 7.7 | 157 | |
| 7.8 | 151 | |
| 8 | 141 | |
| 8.1 | 127 | |
| 7.6 | 123 | |
| 7.9 | 106 | |
| 8.2 | 67 | |
| 8.3 | 44 | 4.4% |
| 8.4 | 31 | 3.1% |
| 8.5 | 20 | 2.0% |
| Other values (6) | 32 | 3.2% |
| Value | Count | Frequency (%) |
| 7.6 | 123 | |
| 7.7 | 157 | |
| 7.8 | 151 | |
| 7.9 | 106 | |
| 8 | 141 | |
| 8.1 | 127 | |
| 8.2 | 67 | |
| 8.3 | 44 | 4.4% |
| 8.4 | 31 | 3.1% |
| 8.5 | 20 | 2.0% |
| Value | Count | Frequency (%) |
| 9.2 | 1 | 0.1% |
| 9 | 3 | 0.3% |
| 8.9 | 3 | 0.3% |
| 8.8 | 5 | 0.5% |
| 8.7 | 5 | 0.5% |
| 8.6 | 15 | 1.5% |
| 8.5 | 20 | 2.0% |
| 8.4 | 31 | |
| 8.3 | 44 | |
| 8.2 | 67 |
Overview
Text
Unique 
| Distinct | 999 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 201.4 KiB |
Length
| Max length | 313 |
|---|---|
| Median length | 197 |
| Mean length | 146.28328 |
| Min length | 40 |
Unique
| Unique | 999 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | An organized crime dynasty's aging patriarch transfers control of his clandestine empire to his reluctant son. |
|---|---|
| 2nd row | When the menace known as the Joker wreaks havoc and chaos on the people of Gotham, Batman must accept one of the greatest psychological and physical tests of his ability to fight injustice. |
| 3rd row | The early life and career of Vito Corleone in 1920s New York City is portrayed, while his son, Michael, expands and tightens his grip on the family crime syndicate. |
| 4th row | A jury holdout attempts to prevent a miscarriage of justice by forcing his colleagues to reconsider the evidence. |
| 5th row | Gandalf and Aragorn lead the World of Men against Sauron's army to draw his gaze from Frodo and Sam as they approach Mount Doom with the One Ring. |
| Value | Count | Frequency (%) |
| a | 1609 | 6.4% |
| the | 1206 | 4.8% |
| to | 803 | 3.2% |
| of | 777 | 3.1% |
| and | 696 | 2.8% |
| in | 565 | 2.3% |
| his | 516 | 2.1% |
| an | 291 | 1.2% |
| is | 245 | 1.0% |
| with | 242 | 1.0% |
| Other values (5878) | 18034 |
Most occurring characters
| Value | Count | Frequency (%) |
| 23999 | ||
| e | 13867 | 9.5% |
| a | 9800 | 6.7% |
| t | 9329 | 6.4% |
| i | 8842 | 6.1% |
| n | 8580 | 5.9% |
| o | 8559 | 5.9% |
| r | 8202 | 5.6% |
| s | 7965 | 5.5% |
| h | 5625 | 3.8% |
| Other values (76) | 41369 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114964 | |
| Space Separator | 24000 | 16.4% |
| Uppercase Letter | 3515 | 2.4% |
| Other Punctuation | 2721 | 1.9% |
| Decimal Number | 509 | 0.3% |
| Dash Punctuation | 395 | 0.3% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Currency Symbol | 4 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 13867 | |
| a | 9800 | 8.5% |
| t | 9329 | 8.1% |
| i | 8842 | 7.7% |
| n | 8580 | 7.5% |
| o | 8559 | 7.4% |
| r | 8202 | 7.1% |
| s | 7965 | 6.9% |
| h | 5625 | 4.9% |
| l | 4847 | 4.2% |
| Other values (23) | 29348 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 712 | |
| T | 258 | 7.3% |
| I | 258 | 7.3% |
| W | 228 | 6.5% |
| S | 223 | 6.3% |
| B | 176 | 5.0% |
| M | 167 | 4.8% |
| C | 158 | 4.5% |
| H | 139 | 4.0% |
| R | 119 | 3.4% |
| Other values (17) | 1077 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 117 | |
| 0 | 104 | |
| 9 | 94 | |
| 2 | 43 | 8.4% |
| 6 | 33 | 6.5% |
| 7 | 30 | 5.9% |
| 5 | 26 | 5.1% |
| 8 | 23 | 4.5% |
| 4 | 21 | 4.1% |
| 3 | 18 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1278 | |
| , | 1082 | |
| ' | 260 | 9.6% |
| " | 60 | 2.2% |
| : | 16 | 0.6% |
| ? | 11 | 0.4% |
| / | 8 | 0.3% |
| ; | 6 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 23999 | ||
| 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 395 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 118479 | |
| Common | 27658 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 13867 | |
| a | 9800 | 8.3% |
| t | 9329 | 7.9% |
| i | 8842 | 7.5% |
| n | 8580 | 7.2% |
| o | 8559 | 7.2% |
| r | 8202 | 6.9% |
| s | 7965 | 6.7% |
| h | 5625 | 4.7% |
| l | 4847 | 4.1% |
| Other values (50) | 32863 |
Common
| Value | Count | Frequency (%) |
| 23999 | ||
| . | 1278 | 4.6% |
| , | 1082 | 3.9% |
| - | 395 | 1.4% |
| ' | 260 | 0.9% |
| 1 | 117 | 0.4% |
| 0 | 104 | 0.4% |
| 9 | 94 | 0.3% |
| " | 60 | 0.2% |
| 2 | 43 | 0.2% |
| Other values (16) | 226 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 146116 | |
| None | 21 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 23999 | ||
| e | 13867 | 9.5% |
| a | 9800 | 6.7% |
| t | 9329 | 6.4% |
| i | 8842 | 6.1% |
| n | 8580 | 5.9% |
| o | 8559 | 5.9% |
| r | 8202 | 5.6% |
| s | 7965 | 5.5% |
| h | 5625 | 3.8% |
| Other values (65) | 41348 |
None
| Value | Count | Frequency (%) |
| é | 9 | |
| » | 2 | 9.5% |
| è | 2 | 9.5% |
| ü | 1 | 4.8% |
| 1 | 4.8% | |
| ä | 1 | 4.8% |
| ç | 1 | 4.8% |
| « | 1 | 4.8% |
| ö | 1 | 4.8% |
| É | 1 | 4.8% |
scoreAvg
Real number (ℝ)
Missing 
| Distinct | 63 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 157 |
| Missing (%) | 15.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.969121 |
| Minimum | 28 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 28 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 70 |
| median | 79 |
| Q3 | 87 |
| 95-th percentile | 96 |
| Maximum | 100 |
| Range | 72 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.383257 |
|---|---|
| Coefficient of variation (CV) | 0.15882258 |
| Kurtosis | 0.41651678 |
| Mean | 77.969121 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.60431623 |
| Sum | 65650 |
| Variance | 153.34506 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76 | 32 | 3.2% |
| 84 | 29 | 2.9% |
| 90 | 29 | 2.9% |
| 86 | 27 | 2.7% |
| 72 | 27 | 2.7% |
| 73 | 27 | 2.7% |
| 85 | 27 | 2.7% |
| 77 | 26 | 2.6% |
| 80 | 26 | 2.6% |
| 81 | 26 | 2.6% |
| Other values (53) | 566 | |
| (Missing) | 157 | 15.7% |
| Value | Count | Frequency (%) |
| 28 | 1 | 0.1% |
| 30 | 1 | 0.1% |
| 33 | 1 | 0.1% |
| 36 | 1 | 0.1% |
| 40 | 1 | 0.1% |
| 41 | 1 | 0.1% |
| 44 | 1 | 0.1% |
| 45 | 3 | |
| 46 | 1 | 0.1% |
| 47 | 4 |
| Value | Count | Frequency (%) |
| 100 | 12 | |
| 99 | 4 | 0.4% |
| 98 | 9 | |
| 97 | 12 | |
| 96 | 18 | |
| 95 | 11 | |
| 94 | 20 | |
| 93 | 14 | |
| 92 | 13 | |
| 91 | 19 |
Director
Text
| Distinct | 548 |
|---|---|
| Distinct (%) | 54.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.7 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 22 |
| Mean length | 13.485485 |
| Min length | 7 |
Unique
| Unique | 353 ? |
|---|---|
| Unique (%) | 35.3% |
Sample
| 1st row | Francis Ford Coppola |
|---|---|
| 2nd row | Christopher Nolan |
| 3rd row | Francis Ford Coppola |
| 4th row | Sidney Lumet |
| 5th row | Peter Jackson |
| Value | Count | Frequency (%) |
| john | 34 | 1.6% |
| david | 28 | 1.4% |
| james | 23 | 1.1% |
| robert | 20 | 1.0% |
| martin | 16 | 0.8% |
| richard | 15 | 0.7% |
| lee | 15 | 0.7% |
| george | 14 | 0.7% |
| steven | 14 | 0.7% |
| alfred | 14 | 0.7% |
| Other values (882) | 1879 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1209 | 9.0% |
| a | 1126 | 8.4% |
| 1073 | 8.0% | |
| n | 950 | 7.1% |
| r | 917 | 6.8% |
| o | 851 | 6.3% |
| i | 834 | 6.2% |
| l | 543 | 4.0% |
| s | 497 | 3.7% |
| t | 433 | 3.2% |
| Other values (59) | 5039 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10223 | |
| Uppercase Letter | 2107 | 15.6% |
| Space Separator | 1073 | 8.0% |
| Other Punctuation | 43 | 0.3% |
| Dash Punctuation | 26 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1209 | |
| a | 1126 | |
| n | 950 | 9.3% |
| r | 917 | 9.0% |
| o | 851 | 8.3% |
| i | 834 | 8.2% |
| l | 543 | 5.3% |
| s | 497 | 4.9% |
| t | 433 | 4.2% |
| h | 404 | 4.0% |
| Other values (26) | 2459 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 179 | 8.5% |
| A | 171 | 8.1% |
| M | 166 | 7.9% |
| J | 162 | 7.7% |
| C | 142 | 6.7% |
| R | 131 | 6.2% |
| H | 110 | 5.2% |
| B | 106 | 5.0% |
| T | 102 | 4.8% |
| D | 99 | 4.7% |
| Other values (19) | 739 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 41 | |
| ' | 2 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1073 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12330 | |
| Common | 1142 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1209 | 9.8% |
| a | 1126 | 9.1% |
| n | 950 | 7.7% |
| r | 917 | 7.4% |
| o | 851 | 6.9% |
| i | 834 | 6.8% |
| l | 543 | 4.4% |
| s | 497 | 4.0% |
| t | 433 | 3.5% |
| h | 404 | 3.3% |
| Other values (55) | 4566 |
Common
| Value | Count | Frequency (%) |
| 1073 | ||
| . | 41 | 3.6% |
| - | 26 | 2.3% |
| ' | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13421 | |
| None | 51 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1209 | 9.0% |
| a | 1126 | 8.4% |
| 1073 | 8.0% | |
| n | 950 | 7.1% |
| r | 917 | 6.8% |
| o | 851 | 6.3% |
| i | 834 | 6.2% |
| l | 543 | 4.0% |
| s | 497 | 3.7% |
| t | 433 | 3.2% |
| Other values (46) | 4988 |
None
| Value | Count | Frequency (%) |
| ó | 10 | |
| á | 9 | |
| é | 8 | |
| ñ | 7 | |
| ô | 5 | |
| ö | 3 | 5.9% |
| ç | 2 | 3.9% |
| Ö | 2 | 3.9% |
| Ô | 1 | 2.0% |
| Ç | 1 | 2.0% |
| Other values (3) | 3 | 5.9% |
Star1
Text
| Distinct | 659 |
|---|---|
| Distinct (%) | 66.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 13.005005 |
| Min length | 4 |
Unique
| Unique | 502 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Marlon Brando |
|---|---|
| 2nd row | Christian Bale |
| 3rd row | Al Pacino |
| 4th row | Henry Fonda |
| 5th row | Elijah Wood |
| Value | Count | Frequency (%) |
| tom | 22 | 1.1% |
| daniel | 17 | 0.8% |
| robert | 17 | 0.8% |
| john | 16 | 0.8% |
| khan | 16 | 0.8% |
| james | 15 | 0.7% |
| michael | 12 | 0.6% |
| hanks | 12 | 0.6% |
| ethan | 11 | 0.5% |
| de | 11 | 0.5% |
| Other values (1112) | 1898 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1239 | 9.5% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.3% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.5% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (62) | 4808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9784 | |
| Uppercase Letter | 2099 | 16.2% |
| Space Separator | 1048 | 8.1% |
| Dash Punctuation | 32 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1239 | |
| e | 1088 | |
| n | 951 | |
| r | 816 | 8.3% |
| i | 794 | 8.1% |
| o | 767 | 7.8% |
| l | 590 | 6.0% |
| t | 453 | 4.6% |
| s | 438 | 4.5% |
| h | 424 | 4.3% |
| Other values (29) | 2224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 187 | 8.9% |
| M | 172 | 8.2% |
| J | 144 | 6.9% |
| D | 142 | 6.8% |
| B | 142 | 6.8% |
| S | 141 | 6.7% |
| R | 126 | 6.0% |
| A | 115 | 5.5% |
| H | 106 | 5.1% |
| T | 104 | 5.0% |
| Other values (19) | 720 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19 | |
| ' | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 1048 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11883 | |
| Common | 1109 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1239 | 10.4% |
| e | 1088 | 9.2% |
| n | 951 | 8.0% |
| r | 816 | 6.9% |
| i | 794 | 6.7% |
| o | 767 | 6.5% |
| l | 590 | 5.0% |
| t | 453 | 3.8% |
| s | 438 | 3.7% |
| h | 424 | 3.6% |
| Other values (58) | 4323 |
Common
| Value | Count | Frequency (%) |
| 1048 | ||
| - | 32 | 2.9% |
| . | 19 | 1.7% |
| ' | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12937 | |
| None | 55 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1239 | 9.6% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.4% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.6% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (45) | 4753 |
None
| Value | Count | Frequency (%) |
| ô | 13 | |
| é | 7 | |
| ü | 6 | |
| í | 6 | |
| û | 4 | 7.3% |
| ö | 4 | 7.3% |
| è | 3 | 5.5% |
| å | 2 | 3.6% |
| ë | 2 | 3.6% |
| Ç | 1 | 1.8% |
| Other values (7) | 7 |
Star2
Text
| Distinct | 840 |
|---|---|
| Distinct (%) | 84.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.7 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 13.122122 |
| Min length | 4 |
Unique
| Unique | 728 ? |
|---|---|
| Unique (%) | 72.9% |
Sample
| 1st row | Al Pacino |
|---|---|
| 2nd row | Heath Ledger |
| 3rd row | Robert De Niro |
| 4th row | Lee J. Cobb |
| 5th row | Viggo Mortensen |
| Value | Count | Frequency (%) |
| john | 21 | 1.0% |
| robert | 16 | 0.8% |
| lee | 13 | 0.6% |
| michael | 13 | 0.6% |
| emma | 10 | 0.5% |
| james | 9 | 0.4% |
| chris | 9 | 0.4% |
| george | 8 | 0.4% |
| tom | 8 | 0.4% |
| jack | 8 | 0.4% |
| Other values (1388) | 1940 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1319 | 10.1% |
| e | 1180 | 9.0% |
| 1056 | 8.1% | |
| n | 949 | 7.2% |
| r | 885 | 6.8% |
| i | 785 | 6.0% |
| o | 719 | 5.5% |
| l | 579 | 4.4% |
| t | 483 | 3.7% |
| s | 432 | 3.3% |
| Other values (59) | 4722 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9904 | |
| Uppercase Letter | 2101 | 16.0% |
| Space Separator | 1056 | 8.1% |
| Dash Punctuation | 24 | 0.2% |
| Other Punctuation | 24 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1319 | |
| e | 1180 | |
| n | 949 | |
| r | 885 | |
| i | 785 | 7.9% |
| o | 719 | 7.3% |
| l | 579 | 5.8% |
| t | 483 | 4.9% |
| s | 432 | 4.4% |
| h | 375 | 3.8% |
| Other values (28) | 2198 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 194 | 9.2% |
| S | 161 | 7.7% |
| J | 158 | 7.5% |
| C | 136 | 6.5% |
| A | 124 | 5.9% |
| B | 122 | 5.8% |
| R | 122 | 5.8% |
| H | 108 | 5.1% |
| K | 105 | 5.0% |
| D | 105 | 5.0% |
| Other values (17) | 766 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 15 | |
| ' | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 1056 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12005 | |
| Common | 1104 | 8.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1319 | 11.0% |
| e | 1180 | 9.8% |
| n | 949 | 7.9% |
| r | 885 | 7.4% |
| i | 785 | 6.5% |
| o | 719 | 6.0% |
| l | 579 | 4.8% |
| t | 483 | 4.0% |
| s | 432 | 3.6% |
| h | 375 | 3.1% |
| Other values (55) | 4299 |
Common
| Value | Count | Frequency (%) |
| 1056 | ||
| - | 24 | 2.2% |
| . | 15 | 1.4% |
| ' | 9 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13046 | |
| None | 63 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1319 | 10.1% |
| e | 1180 | 9.0% |
| 1056 | 8.1% | |
| n | 949 | 7.3% |
| r | 885 | 6.8% |
| i | 785 | 6.0% |
| o | 719 | 5.5% |
| l | 579 | 4.4% |
| t | 483 | 3.7% |
| s | 432 | 3.3% |
| Other values (45) | 4659 |
None
| Value | Count | Frequency (%) |
| é | 19 | |
| ô | 9 | |
| ö | 7 | 11.1% |
| ç | 6 | 9.5% |
| í | 5 | 7.9% |
| ü | 5 | 7.9% |
| è | 3 | 4.8% |
| Ö | 2 | 3.2% |
| á | 2 | 3.2% |
| ó | 1 | 1.6% |
| Other values (4) | 4 | 6.3% |
Star3
Text
| Distinct | 890 |
|---|---|
| Distinct (%) | 89.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.4 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 21 |
| Mean length | 13.283283 |
| Min length | 4 |
Unique
| Unique | 808 ? |
|---|---|
| Unique (%) | 80.9% |
Sample
| 1st row | James Caan |
|---|---|
| 2nd row | Aaron Eckhart |
| 3rd row | Robert Duvall |
| 4th row | Martin Balsam |
| 5th row | Ian McKellen |
| Value | Count | Frequency (%) |
| john | 21 | 1.0% |
| robert | 16 | 0.8% |
| michael | 13 | 0.6% |
| richard | 12 | 0.6% |
| christopher | 9 | 0.4% |
| jack | 8 | 0.4% |
| paul | 8 | 0.4% |
| george | 7 | 0.3% |
| lee | 7 | 0.3% |
| harris | 7 | 0.3% |
| Other values (1460) | 1951 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1290 | 9.7% |
| e | 1152 | 8.7% |
| 1060 | 8.0% | |
| n | 925 | 7.0% |
| i | 894 | 6.7% |
| r | 864 | 6.5% |
| o | 757 | 5.7% |
| l | 626 | 4.7% |
| t | 443 | 3.3% |
| s | 423 | 3.2% |
| Other values (63) | 4836 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10057 | |
| Uppercase Letter | 2097 | 15.8% |
| Space Separator | 1060 | 8.0% |
| Other Punctuation | 33 | 0.2% |
| Dash Punctuation | 23 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1290 | |
| e | 1152 | |
| n | 925 | |
| i | 894 | 8.9% |
| r | 864 | 8.6% |
| o | 757 | 7.5% |
| l | 626 | 6.2% |
| t | 443 | 4.4% |
| s | 423 | 4.2% |
| h | 392 | 3.9% |
| Other values (33) | 2291 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 189 | 9.0% |
| J | 158 | 7.5% |
| S | 156 | 7.4% |
| C | 142 | 6.8% |
| R | 141 | 6.7% |
| B | 139 | 6.6% |
| A | 119 | 5.7% |
| G | 113 | 5.4% |
| H | 112 | 5.3% |
| K | 107 | 5.1% |
| Other values (16) | 721 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 23 | |
| ' | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 1060 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12154 | |
| Common | 1116 | 8.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1290 | 10.6% |
| e | 1152 | 9.5% |
| n | 925 | 7.6% |
| i | 894 | 7.4% |
| r | 864 | 7.1% |
| o | 757 | 6.2% |
| l | 626 | 5.2% |
| t | 443 | 3.6% |
| s | 423 | 3.5% |
| h | 392 | 3.2% |
| Other values (59) | 4388 |
Common
| Value | Count | Frequency (%) |
| 1060 | ||
| . | 23 | 2.1% |
| - | 23 | 2.1% |
| ' | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13219 | |
| None | 51 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1290 | 9.8% |
| e | 1152 | 8.7% |
| 1060 | 8.0% | |
| n | 925 | 7.0% |
| i | 894 | 6.8% |
| r | 864 | 6.5% |
| o | 757 | 5.7% |
| l | 626 | 4.7% |
| t | 443 | 3.4% |
| s | 423 | 3.2% |
| Other values (45) | 4785 |
None
| Value | Count | Frequency (%) |
| é | 11 | |
| ü | 5 | |
| á | 5 | |
| í | 4 | 7.8% |
| ô | 4 | 7.8% |
| û | 4 | 7.8% |
| ó | 3 | 5.9% |
| å | 2 | 3.9% |
| ö | 2 | 3.9% |
| ç | 2 | 3.9% |
| Other values (8) | 9 |
Star4
Text
| Distinct | 938 |
|---|---|
| Distinct (%) | 93.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 71.0 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 13.211211 |
| Min length | 4 |
Unique
| Unique | 881 ? |
|---|---|
| Unique (%) | 88.2% |
Sample
| 1st row | Diane Keaton |
|---|---|
| 2nd row | Michael Caine |
| 3rd row | Diane Keaton |
| 4th row | John Fiedler |
| 5th row | Orlando Bloom |
| Value | Count | Frequency (%) |
| john | 25 | 1.2% |
| michael | 15 | 0.7% |
| james | 12 | 0.6% |
| lee | 9 | 0.4% |
| richard | 9 | 0.4% |
| mark | 8 | 0.4% |
| bill | 8 | 0.4% |
| martin | 7 | 0.3% |
| charles | 7 | 0.3% |
| kim | 7 | 0.3% |
| Other values (1557) | 1960 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1282 | 9.7% |
| e | 1127 | 8.5% |
| 1068 | 8.1% | |
| n | 903 | 6.8% |
| r | 901 | 6.8% |
| i | 861 | 6.5% |
| o | 710 | 5.4% |
| l | 631 | 4.8% |
| s | 445 | 3.4% |
| t | 419 | 3.2% |
| Other values (63) | 4851 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9956 | |
| Uppercase Letter | 2113 | 16.0% |
| Space Separator | 1068 | 8.1% |
| Dash Punctuation | 32 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1282 | |
| e | 1127 | |
| n | 903 | 9.1% |
| r | 901 | 9.0% |
| i | 861 | 8.6% |
| o | 710 | 7.1% |
| l | 631 | 6.3% |
| s | 445 | 4.5% |
| t | 419 | 4.2% |
| h | 409 | 4.1% |
| Other values (30) | 2268 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 200 | 9.5% |
| S | 173 | 8.2% |
| B | 166 | 7.9% |
| J | 161 | 7.6% |
| R | 135 | 6.4% |
| C | 134 | 6.3% |
| A | 117 | 5.5% |
| K | 112 | 5.3% |
| D | 104 | 4.9% |
| L | 101 | 4.8% |
| Other values (19) | 710 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20 | |
| ' | 9 |
Space Separator
| Value | Count | Frequency (%) |
| 1068 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12069 | |
| Common | 1129 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1282 | 10.6% |
| e | 1127 | 9.3% |
| n | 903 | 7.5% |
| r | 901 | 7.5% |
| i | 861 | 7.1% |
| o | 710 | 5.9% |
| l | 631 | 5.2% |
| s | 445 | 3.7% |
| t | 419 | 3.5% |
| h | 409 | 3.4% |
| Other values (59) | 4381 |
Common
| Value | Count | Frequency (%) |
| 1068 | ||
| - | 32 | 2.8% |
| . | 20 | 1.8% |
| ' | 9 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13136 | |
| None | 62 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1282 | 9.8% |
| e | 1127 | 8.6% |
| 1068 | 8.1% | |
| n | 903 | 6.9% |
| r | 901 | 6.9% |
| i | 861 | 6.6% |
| o | 710 | 5.4% |
| l | 631 | 4.8% |
| s | 445 | 3.4% |
| t | 419 | 3.2% |
| Other values (46) | 4789 |
None
| Value | Count | Frequency (%) |
| é | 12 | |
| ô | 11 | |
| ö | 9 | |
| û | 5 | |
| á | 4 | 6.5% |
| è | 3 | 4.8% |
| ø | 2 | 3.2% |
| å | 2 | 3.2% |
| Á | 2 | 3.2% |
| ë | 2 | 3.2% |
| Other values (7) | 10 |
Votes
Real number (ℝ)
High correlation 
| Distinct | 998 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 271621.42 |
| Minimum | 25088 |
|---|---|
| Maximum | 2303232 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 25088 |
|---|---|
| 5-th percentile | 29680 |
| Q1 | 55471.5 |
| median | 138356 |
| Q3 | 373167.5 |
| 95-th percentile | 939289.9 |
| Maximum | 2303232 |
| Range | 2278144 |
| Interquartile range (IQR) | 317696 |
Descriptive statistics
| Standard deviation | 320912.62 |
|---|---|
| Coefficient of variation (CV) | 1.1814702 |
| Kurtosis | 6.041324 |
| Mean | 271621.42 |
| Median Absolute Deviation (MAD) | 98475 |
| Skewness | 2.1943511 |
| Sum | 2.713498 × 108 |
| Variance | 1.0298491 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65341 | 2 | 0.2% |
| 171640 | 1 | 0.1% |
| 699256 | 1 | 0.1% |
| 32802 | 1 | 0.1% |
| 93878 | 1 | 0.1% |
| 1213505 | 1 | 0.1% |
| 51853 | 1 | 0.1% |
| 1642758 | 1 | 0.1% |
| 2067042 | 1 | 0.1% |
| 1854740 | 1 | 0.1% |
| Other values (988) | 988 |
| Value | Count | Frequency (%) |
| 25088 | 1 | |
| 25198 | 1 | |
| 25229 | 1 | |
| 25312 | 1 | |
| 25344 | 1 | |
| 25938 | 1 | |
| 26337 | 1 | |
| 26402 | 1 | |
| 26429 | 1 | |
| 26457 | 1 |
| Value | Count | Frequency (%) |
| 2303232 | 1 | |
| 2067042 | 1 | |
| 1854740 | 1 | |
| 1826188 | 1 | |
| 1809221 | 1 | |
| 1676426 | 1 | |
| 1661481 | 1 | |
| 1642758 | 1 | |
| 1620367 | 1 | |
| 1516346 | 1 |
Revenue
Real number (ℝ)
High correlation  Missing 
| Distinct | 822 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 169 |
| Missing (%) | 16.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68082574 |
| Minimum | 1305 |
|---|---|
| Maximum | 9.3666222 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1305 |
|---|---|
| 5-th percentile | 139783.9 |
| Q1 | 3245338.5 |
| median | 23457440 |
| Q3 | 80876340 |
| 95-th percentile | 2.9163069 × 108 |
| Maximum | 9.3666222 × 108 |
| Range | 9.3666092 × 108 |
| Interquartile range (IQR) | 77631002 |
Descriptive statistics
| Standard deviation | 1.0980755 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.6128584 |
| Kurtosis | 13.894054 |
| Mean | 68082574 |
| Median Absolute Deviation (MAD) | 22698854 |
| Skewness | 3.1277452 |
| Sum | 5.6508537 × 1010 |
| Variance | 1.2057699 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4360000 | 5 | 0.5% |
| 5321508 | 2 | 0.2% |
| 5450000 | 2 | 0.2% |
| 9600000 | 2 | 0.2% |
| 25000000 | 2 | 0.2% |
| 216540909 | 1 | 0.1% |
| 49530280 | 1 | 0.1% |
| 78756177 | 1 | 0.1% |
| 292576195 | 1 | 0.1% |
| 30500000 | 1 | 0.1% |
| Other values (812) | 812 | |
| (Missing) | 169 | 16.9% |
| Value | Count | Frequency (%) |
| 1305 | 1 | |
| 3296 | 1 | |
| 3600 | 1 | |
| 6013 | 1 | |
| 6460 | 1 | |
| 7461 | 1 | |
| 8060 | 1 | |
| 10177 | 1 | |
| 10950 | 1 | |
| 12562 | 1 |
| Value | Count | Frequency (%) |
| 936662225 | 1 | |
| 858373000 | 1 | |
| 760507625 | 1 | |
| 678815482 | 1 | |
| 659325379 | 1 | |
| 623279547 | 1 | |
| 608581744 | 1 | |
| 534858444 | 1 | |
| 532177324 | 1 | |
| 448139099 | 1 |
Interactions
Correlations
| Certificate | Rating | Revenue | Runtime | Unnamed: 0 | Votes | Year | scoreAvg | |
|---|---|---|---|---|---|---|---|---|
| Certificate | 1.000 | 0.000 | 0.063 | 0.141 | 0.072 | 0.057 | 0.302 | 0.088 |
| Rating | 0.000 | 1.000 | -0.050 | 0.210 | -0.992 | 0.212 | -0.127 | 0.285 |
| Revenue | 0.063 | -0.050 | 1.000 | 0.178 | 0.036 | 0.700 | 0.175 | -0.100 |
| Runtime | 0.141 | 0.210 | 0.178 | 1.000 | -0.233 | 0.157 | 0.194 | -0.090 |
| Unnamed: 0 | 0.072 | -0.992 | 0.036 | -0.233 | 1.000 | -0.245 | 0.012 | -0.259 |
| Votes | 0.057 | 0.212 | 0.700 | 0.157 | -0.245 | 1.000 | 0.255 | -0.073 |
| Year | 0.302 | -0.127 | 0.175 | 0.194 | 0.012 | 0.255 | 1.000 | -0.264 |
| scoreAvg | 0.088 | 0.285 | -0.100 | -0.090 | -0.259 | -0.073 | -0.264 | 1.000 |
Missing values
Sample
| Unnamed: 0 | Title | Year | Certificate | Runtime | Genre | Rating | Overview | scoreAvg | Director | Star1 | Star2 | Star3 | Star4 | Votes | Revenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | The Godfather | 1972 | A | 175 | Crime, Drama | 9.2 | An organized crime dynasty's aging patriarch transfers control of his clandestine empire to his reluctant son. | 100 | Francis Ford Coppola | Marlon Brando | Al Pacino | James Caan | Diane Keaton | 1620367 | 134966411.0 |
| 1 | 2 | The Dark Knight | 2008 | UA | 152 | Action, Crime, Drama | 9.0 | When the menace known as the Joker wreaks havoc and chaos on the people of Gotham, Batman must accept one of the greatest psychological and physical tests of his ability to fight injustice. | 84 | Christopher Nolan | Christian Bale | Heath Ledger | Aaron Eckhart | Michael Caine | 2303232 | 534858444.0 |
| 2 | 3 | The Godfather: Part II | 1974 | A | 202 | Crime, Drama | 9.0 | The early life and career of Vito Corleone in 1920s New York City is portrayed, while his son, Michael, expands and tightens his grip on the family crime syndicate. | 90 | Francis Ford Coppola | Al Pacino | Robert De Niro | Robert Duvall | Diane Keaton | 1129952 | 57300000.0 |
| 3 | 4 | 12 Angry Men | 1957 | U | 96 | Crime, Drama | 9.0 | A jury holdout attempts to prevent a miscarriage of justice by forcing his colleagues to reconsider the evidence. | 96 | Sidney Lumet | Henry Fonda | Lee J. Cobb | Martin Balsam | John Fiedler | 689845 | 4360000.0 |
| 4 | 5 | The Lord of the Rings: The Return of the King | 2003 | U | 201 | Action, Adventure, Drama | 8.9 | Gandalf and Aragorn lead the World of Men against Sauron's army to draw his gaze from Frodo and Sam as they approach Mount Doom with the One Ring. | 94 | Peter Jackson | Elijah Wood | Viggo Mortensen | Ian McKellen | Orlando Bloom | 1642758 | 377845905.0 |
| 5 | 6 | Pulp Fiction | 1994 | A | 154 | Crime, Drama | 8.9 | The lives of two mob hitmen, a boxer, a gangster and his wife, and a pair of diner bandits intertwine in four tales of violence and redemption. | 94 | Quentin Tarantino | John Travolta | Uma Thurman | Samuel L. Jackson | Bruce Willis | 1826188 | 107928762.0 |
| 6 | 7 | Schindler's List | 1993 | A | 195 | Biography, Drama, History | 8.9 | In German-occupied Poland during World War II, industrialist Oskar Schindler gradually becomes concerned for his Jewish workforce after witnessing their persecution by the Nazis. | 94 | Steven Spielberg | Liam Neeson | Ralph Fiennes | Ben Kingsley | Caroline Goodall | 1213505 | 96898818.0 |
| 7 | 8 | Inception | 2010 | UA | 148 | Action, Adventure, Sci-Fi | 8.8 | A thief who steals corporate secrets through the use of dream-sharing technology is given the inverse task of planting an idea into the mind of a C.E.O. | 74 | Christopher Nolan | Leonardo DiCaprio | Joseph Gordon-Levitt | Elliot Page | Ken Watanabe | 2067042 | 292576195.0 |
| 8 | 9 | Fight Club | 1999 | A | 139 | Drama | 8.8 | An insomniac office worker and a devil-may-care soapmaker form an underground fight club that evolves into something much, much more. | 66 | David Fincher | Brad Pitt | Edward Norton | Meat Loaf | Zach Grenier | 1854740 | 37030102.0 |
| 9 | 10 | The Lord of the Rings: The Fellowship of the Ring | 2001 | U | 178 | Action, Adventure, Drama | 8.8 | A meek Hobbit from the Shire and eight companions set out on a journey to destroy the powerful One Ring and save Middle-earth from the Dark Lord Sauron. | 92 | Peter Jackson | Elijah Wood | Ian McKellen | Orlando Bloom | Sean Bean | 1661481 | 315544750.0 |
| Unnamed: 0 | Title | Year | Certificate | Runtime | Genre | Rating | Overview | scoreAvg | Director | Star1 | Star2 | Star3 | Star4 | Votes | Revenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 989 | 990 | Giù la testa | 1971 | PG | 157 | Drama, War, Western | 7.6 | A low-life bandit and an I.R.A. explosives expert rebel against the government and become heroes of the Mexican Revolution. | 77 | Sergio Leone | Rod Steiger | James Coburn | Romolo Valli | Maria Monti | 30144 | 696690.0 |
| 990 | 991 | Kelly's Heroes | 1970 | GP | 144 | Adventure, Comedy, War | 7.6 | A group of U.S. soldiers sneaks across enemy lines to get their hands on a secret stash of Nazi treasure. | 50 | Brian G. Hutton | Clint Eastwood | Telly Savalas | Don Rickles | Carroll O'Connor | 45338 | 1378435.0 |
| 991 | 992 | The Jungle Book | 1967 | U | 78 | Animation, Adventure, Family | 7.6 | Bagheera the Panther and Baloo the Bear have a difficult time trying to convince a boy to leave the jungle for human civilization. | 65 | Wolfgang Reitherman | Phil Harris | Sebastian Cabot | Louis Prima | Bruce Reitherman | 166409 | 141843612.0 |
| 992 | 993 | Blowup | 1966 | A | 111 | Drama, Mystery, Thriller | 7.6 | A fashion photographer unknowingly captures a death on film after following two lovers in a park. | 82 | Michelangelo Antonioni | David Hemmings | Vanessa Redgrave | Sarah Miles | John Castle | 56513 | NaN |
| 993 | 994 | A Hard Day's Night | 1964 | U | 87 | Comedy, Music, Musical | 7.6 | Over two "typical" days in the life of The Beatles, the boys struggle to keep themselves and Sir Paul McCartney's mischievous grandfather in check while preparing for a live television performance. | 96 | Richard Lester | John Lennon | Paul McCartney | George Harrison | Ringo Starr | 40351 | 13780024.0 |
| 994 | 995 | Breakfast at Tiffany's | 1961 | A | 115 | Comedy, Drama, Romance | 7.6 | A young New York socialite becomes interested in a young man who has moved into her apartment building, but her past threatens to get in the way. | 76 | Blake Edwards | Audrey Hepburn | George Peppard | Patricia Neal | Buddy Ebsen | 166544 | NaN |
| 995 | 996 | Giant | 1956 | G | 201 | Drama, Western | 7.6 | Sprawling epic covering the life of a Texas cattle rancher and his family and associates. | 84 | George Stevens | Elizabeth Taylor | Rock Hudson | James Dean | Carroll Baker | 34075 | NaN |
| 996 | 997 | From Here to Eternity | 1953 | Passed | 118 | Drama, Romance, War | 7.6 | In Hawaii in 1941, a private is cruelly punished for not boxing on his unit's team, while his captain's wife and second-in-command are falling in love. | 85 | Fred Zinnemann | Burt Lancaster | Montgomery Clift | Deborah Kerr | Donna Reed | 43374 | 30500000.0 |
| 997 | 998 | Lifeboat | 1944 | NaN | 97 | Drama, War | 7.6 | Several survivors of a torpedoed merchant ship in World War II find themselves in the same lifeboat with one of the crew members of the U-boat that sank their ship. | 78 | Alfred Hitchcock | Tallulah Bankhead | John Hodiak | Walter Slezak | William Bendix | 26471 | NaN |
| 998 | 999 | The 39 Steps | 1935 | NaN | 86 | Crime, Mystery, Thriller | 7.6 | A man in London tries to help a counter-espionage Agent. But when the Agent is killed, and the man stands accused, he must go on the run to save himself and stop a spy ring which is trying to steal top secret information. | 93 | Alfred Hitchcock | Robert Donat | Madeleine Carroll | Lucie Mannheim | Godfrey Tearle | 51853 | NaN |